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An algorithm is developed to control a class of rearrangeable switch- 
ing networks, particularly with the base-2 structure. Various methods 
of implementing this algorithm are also described. System organiza- 
tion and processing time for rearranging the network are studied and 
are shown to be practical. 

I. INTRODUCTION 

One type of a switching network which has drawn considerable in- 
terest lately is the class of rearrangeable switching networks (RSN). 
With these networks, any idle input terminal of the network can al- 
ways bo connected to any idle output terminal by rerouting the ex- 
isting connections if necessary. These networks can be used where one- 
to-one full access and nonblocking features are required, and rerouting 
is feasible, e.g., main distribution frames 1 and facility switches 2 in 
telephone systems and data transfer networks in a multiprocessor 
computer system. 3 

Most of the earlier efforts, notably by C. Clos, 4 V. E. Benes, 5 and 
A. E. Joel, Jr., have been made in the context of telephone switching 
networks. Their emphasis has been on the network structure, on its 
combinatorial properties and on bounds on the number of connections 
that require rerouting. Recently, this type of network has been of 
interest in such computer areas as data-sorting systems 7 and self- 
repairing multiprocessors. 3 The network structure is also applicable for 
cellular arrays." However, very few reports" 10 have been made on the 
control aspect of these networks. 

This paper will begin with a brief discussion of the general struc- 
ture of RSN's, followed by the development of a method for the con- 
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trol of these networks and its practical implementation. The relation- 
ship between the network structure and the ease (or difficulty) with 
which it can be controlled will also be discussed. 

II. THE NETWORK STRUCTURE 

Discussion will be limited to a class of rearrangeable switching net- 
works connecting N input terminals and N output terminals [abbre- 
viated as (N X N) networks]. Extension to the more general case for 
(N X M) networks, N ¥=M, can be readily made. 

Let N = dq, where d and q are integer factors of N. A (N X N) 
network can be decomposed into an input stage and an output stage 
having altogether 2N/d (d X d) networks (one of which can be elimi- 
nated) and a middle stage having d {N/d X N/d) networks, as shown 
in Fig. 1. This network is said to have a base-rf structure, and the 
smaller networks are called subnetworks. This type of network struc- 
ture falls into the general class considered by Clos and Benes. 

The network with base-2 structure is of great importance, for two 
primary reasons. First, it yields the most efficient network lor the 
least number of two-state switching elements) and, secondly, its con- 
trol is relatively simple. It consists of (2 X 2) networks in the input 
and output stages, altogether (N — 1) in number where N is even or 
odd. Two (N/2 X N/2) networks are in the middle stage if N is even, 
or one [ (N - l)/2 X (A r - l)/2] network and one [ (JV + l)/2 X (N + 
l)/2] network are in the middle stage if N is odd, as shown in Fig. 2a 
and b. Clearly, further decomposition of the networks in the middle 
stage is possible, and if the base-2 structure is carried throughout, one 
may show by an iterative process that the total number of basic switch- 




Fig. 1 — Rearrangeable (N X N) network of a general base-d structure. 
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Fig. 2— An (N X N) network with base-2 structure for: (a) N even; (b) N odd. 



ing elements or (2 x 2) networks, named ^-elements by Joel, 6 is given 
by* 

N(\og 2 N) - 2 <U "' N) + 1. 
It can easily be seen that the number of these ^-elements is bounded 
by <log 2 iV!>. A (11 X 11 ) network consisting entirely of .^-elements is 
shown in Fig. 3. It is a very efficient network, since there are 29 such 
elements and one must have 26 (>logo(ll !) >25) two-state devices to 
accommodate all possible permutations. Some of the enumeration 
studies given in Part II will account for the additional states. 

III. THE NETWORK CONTROL 

The control algorithm is first developed for the general (N x N) 
network, having a base-rf structure, as it is shown in Fig. 1. The 
special case where d = 2 will then be considered, and practical imple- 
mentations of the control algorithm will be given in Section IV. First, 
some definitions are needed. 



* (2) \a the smallest integer greater than z. 
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Fig. 3 — An (11 X 11) network with base-2 structure. 



3.1 Definitions 

(i) An input-output pair [,*,,] defines a connection between input 
terminal x and output terminal t(x) of a. (N X N) network, where 
1 £ X £ N, 1 ^ r(x) ^ N 

(ii) A connection set C is a collection of m such input-output pairs, 
m ^ N, expressed as follows: 

C = 

Wfa) t(x 2 ) • • • ir(x„)j 

If m = N, C is then in the form of the familiar permutation. This may 
be denoted by P, with C C P, describing all the connections (or the 
traffic pattern) through the (N X N) network. 

(Hi) Let I = {1,2, • • • , N] be a set of positive distinct integers; 
a subset J (I, d) is defined by 

J(l, d) = L | [ a + ^ = - ]* = A , and aEI 

where d divides N, and £ is some constant integer, 1 ^ I ^ A/d. 
,/(£, d) is called an integer set of order d and of characteristic I, and 
any element belonging to it is said to have characteristic I relative to 
the base-d. This is merely a formal way of grouping all those termi- 
nals associated with the same subnetwork in the input (or output) 
stage. 

(iv) A connection set C having m input-output pairs is said to be 



* [m>] denotes the integral value of w. 
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reducible if and only if on replacing every integer by its characteristic, 
C becomes a permutation on m distinct integers. 

3.2 The Generalized Control Algorithm 

Any given set of connections through the network can be described 
as: 

p / x x x a ■•■ X N \ 

VCr,) tt(i- 2 ) • • • Tr(x.v)/ 

The objective of the control algorithm is to derive from P permutations 
to be realized by each of the subnetworks. This is accomplished by 
first decomposing P into reducible connection sets Ci , C 2 , ■ • • , C d . 
Permutations for the middle subnetworks are defined by using the 
characteristics of the elements in these sets, and permutations for the 
input and output stages are determined directly from these elements. 

3.2.1 To Decompose a Permutation into Reducible Connection Sets 
Let the output terminals be partitioned into sets denoted by S t where 

8, = Kr.) \x,eJ(l,d)}, l&l**' 

The reducible connection sets C, , 1 ^ i ^ d, are constructed by group- 
ing N/d input-output pairs of which the output integers are selected, 
one from each S, , such that no two output integers have the same 
characteristic. 



Let Ci be expressed as 
C, = 



.r,,, X; i2 • • • Xi.N/d 
v(Xi,i) ir(x it2 ) ••• ir(x iiN/d )\ 



1 £ t £ d; 



when it is reduced, one has the permutation 

Pi.l Pi. 2 ••• Pi.N/d 



Pi = . 

7r(p..i) *(Pi.i) *(Pi.N/d) 

where p,,, and 7r(p,.,) are the characteristics of the integers x,.,- and 
ir(x ,.,), respectively, 1 ^ i £ d, 1 £ j ^ AT/d. Each of these d permuta- 
tions Pi can be realized by any of the d (N/d X N/d) subnetworks. 
However, if each C, is assigned to a particular (N/d X N/d) subnetwork, 
one of the (d X d) subnetworks, say the one at the upper-left corner 
of the input stage (Fig. 1), can be eliminated. This elimination implies 
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that the connection set C, , and hence P, , is assigned to the first 
(N/d X N/d) subnetwork in the middle stage if and only if re,,, = 1 
for some j, 1 ^ ; ^ N/d. In general, C, , and hence P, , is assigned to 
the A; th (N/d X N/d) subnetwork in the middle stage, 1 ^ k £ d, 
(counting from the top) if and only if x iti = k for some j, 1 ^ j ^ N/d. 
One may then reorder the indices i such that C, contains the input- 
output pair [ViJ'o) , and it follows that the permutations P x , P 2 , ■ • • , P d 
are similarly ordered for the d (N/d X N/d) subnetworks in the middle 
stage. 

3.2.2 To Obtain Permutations for Subnetworks in the Input and Output 
Stages 

Let P/,, , P/,2 , ■ • • , Pi. N/d and P .i , P , 2 , ■ • ■ , Po.N/d be the sets 
of permutations to be realized by the l 8t , 2 nd , • • • , (N/d) th (d X d) sub- 
networks in the input and output stage, respectively. Moreover, let 
the reducible connection set C, be written as 



Ci = 



1 ^ i ^ d 



[ir(x iA ) 7r(x,. 2 ) • • • ir(Xi, N /d)\ 

such that Xi t i < x,-_ 2 • ■ • < • • • < .r,-, N/d . 

Then, from the network structure, one can see simply that the 
permutations 



and 



Pr.i = 



Po. k = 





1 2 ••• d 

7T A (1) 7T t (2) ••• T k (d) 



can be obtained from C, with 

wj\a) = x a ,i - (j - l)d, 1 ^ j ^ N/d, 1 ^ a ^ d 

where T~ 1 (a) denotes the input terminal to be connected to the output 
terminal a, and ir k (a) = ir(x ail ) — (k — 1) d, 1 ^ k ^ N/d, for some 
/ such that ir(x a , t ) G J(k, d). 

3.2.3 An Example on Decomposing a Permutation 

Consider a (15 X 15) network, having a base d = 5 structure. Such 
a network is shown in Fig. 4, with two (5 X 5) subnetworks in the 
input stage, three (5 X 5) subnetworks in the output stage, and five 
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P,* = 



MIDDLE STAGE 

6 3 5 2 10 II I 4 13 9 15 7 8 14 I2 N 
1 4 6 15 9 12 II 2 3 8 10 7 5 13 14 ; 



Fid. 4— A (15 X 15) network with permutations assignment. 

(3x3) subnetworks in the middle stage. Let the connections through 
the network be described by the following permutation: 



P = 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 



,11 15 4 2 6 1 7 5 8 9 12 14 3 13 10/ 

The output integers are partitioned as follows: 

S t = {11,15,4,2,6}, 

5 2 - (1,7,5,8,9|, 

5 3 = |12,14,3,13,10|. 

From these, the reducible connection sets d , C 2 , C 3 , C 4 , and C 5 
are constructed as shown in Table I and, in general, they are not unique. 
The corresponding permutations (P,), which are also shown in Table I, 
are obtained from C, by replacing each element with its characteristic. 
Note that connection sets are ordered according to the mput integers 
(1, 2, 3, 4, 5) and that z,., < x,, 2 < »,-.i for each i. Tabl j II shows the 
permutations derived from C, for subnetworks in the input and output 
stages using the relations for rj l (p) and ir k (a) as given in Section 3.2.2. 
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Table I — Reduced Connection Sets and Their Corresponding 

Permutations 



l 


c, 


P, 


1 




fl 8 151 
11 5 10J 


C 23 ) 

\3 1 2/ 


2 




f 2 9 13-j 

.15 8 3j 


C 23 ) 

\3 2 1/ 


3 


r 3 io ir 

I 4 9 12, 




C 23 ) 

\1 2 3/ 


4 




[ 4 7 121 

.2 7 14J 


C 23 ) 

\1 2 3/ 


5 




[5 6 141 
6 1 13 




C 23 ) 

\2 1 3/ 



Table II — Permutations for Subnetworks in the Input and 

Output Stages 



3 


P/,y 


Po.y 


1 


[12 3 4 5] 
.12 3 4 5. 


/l 2 3 4 5\ 
\5 3 4 2 1/ 


2 


[3 4 5 2 11 
12 3 4 5. 


/l 2 3 4 5\ 
\5 3 4 2 1/ 


3 


p 3 1 2 41 
[l2345j 


/l 2 3 4 5\ 
\1 5 2 4 3/ 



IV. the control algorithm for the base-2 network 

The selection of output integers from the set 5/ , 1 ^ I ^ N/d, to 
form the connection sets Ci , 1 ^ 2 '. ^ d, is by no means simple. One 
procedure has been reported by V. I. Neiman. 10 By modifying the 
previous example, it can be shown that the selection can not be made 
on a strictly sequential basis. Let the given permutation be modified 
such that the sets Si , S 2 , and S :i arc as follows: 
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flf, = {11, 10,4,2,61, 

5 2 = [1,7,5,8,9}, 

5 3 = {12, 14,3, 13, 15). 

If one had chosen [,j] and [,] input-output pairs to form C, , where 
the output integers 11 and 1 are from S x and <S 2 respectively, there 
is no output integer in S 3 which would have a characteristic different 
from that of the integer 11 or 1. Thus, in general, simultaneous selections 
must be made in the construction of the connection sets. For networks 
with base-2 structure, however, the selection is reduced to a mere 
binary choice, resulting in a much simpler algorithm. For the other 
extreme case, 11 where d = N/2, the difficulty described above will 
not arise because there are only two sets <S, and S 2 , each containing 
exactly d integers. 

For a (N x N) network with a base-2 structure, the control 
algorithm for setting the ^-elements to realize a given permutation P 
consists of three parts : (?) decomposing of P into reducible connection 
sets Cx and C- 2 \ (n) reducing C, and C 2 to Pi and P 2 respectively; 
and (Hi) setting the /^-elements in the input and output stages. Since 
the network has an iterative structure, the same procedure is applied 
to each of the (N/2 X N/2), (AT/4 x N/4), •••,(2x2) subnet- 
works. There are log 2 -V levels of an (A 7 x N) network with the last 
level of (2 X 2) subnetworks being a trivial case, assuming A r is a 
power of 2. 

A coding scheme for the input and output integers which facilitates 
the required operations and two methods for decomposing P are 
described in the following sections. 

4.1 Coding Scheme 

It is clear from what has been discussed so far that the control 
algorithm essentially accepts the connection requirements as input 
data and, after processing, generates a set of output data which are 
used to rearrange the network. It is necessary, therefore, to have an 
input/output (I/O) memory, which stores the output terminal 7r(.r ( ) at 
the address determined by the numerical value of the input terminal x { . 
A simple coding scheme for these integers proves to simplify the 
implementation of the algorithm. 

Referring again to the network, one can, of course, use the set of 
integers (0, 1, 2, • • • , N — 1) to number the input and output ter- 
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minals, without loss of generality in all the previous discussions.* 
Then, the familiar binary code can be used directly, both for the 
input Xi as address and for the output ir(xi) as the contents at x t . We 
shall now show how this code can be used at all \og 2 N levels. Let the 
binary representation be 

6 n _ 1 6„_ 2 • • • bA , 

where n = \og>>N, and assuming N a power of 2. Beginning at the first 
level of the network, the least significant bit of the address is used to 
set the input /^-element defined by the remaining n — 1 bits, and that 
of the contents to set the output /^-element defined by the remain- 
ing n — 1 bits of the contents. Moreover, the conversion from C\ (or 
C 2 ) to Pi (or P 2 ) is accomplished by merely eliminating 6 for each 
coded output integer. Finally, for an output integer u-(.r,), the bit b» is 
set to identify whether the particular w(xt) belongs to Pi or P 2 . How- 
ever, for an input integer, the most significant bit of the address, 6„_i , 
indicates whether x { belongs to Pi or P-> . 

The same coding procedure is applied at each subnetwork, and the 
I/O memory is partitioned (part of algorithm) in the appropriate 
manner. In general, at the i ,h level of the network, i < \0g2N, the (i — 1) 
least significant bits of a word in memory define the particular sub- 
network of size N(2 1 ~ i ) , and the \ogoN — (i — 1) most significant bits 
define the output integer. The (i— 1) most significant bits, however, of 
the address designate the subnetwork, and the remaining bits define 
the input integer. An example will be given in detail to illustrate 
this in Section 4.2.1. 

4.2 Decomposition by Looping 

With d = 2, an integer set is reduced to an integer pair, consisting 
of only two elements, and one is said to be the dual of the other. If one 
continues to use the integers (0, 1, ■ • • , N — 1) to number the input 
and output terminals of an {N x A0 network, then the integers a and 
b constitute an integer pair if 



= I 
for some integer I. The dual of a (or b) is denoted by d (or 6), and, 



* Except that the definition of the integer set J(l, d) needs to be slightly modi- 
fied, as follows : 

J(l, d) = [a\[a/d] = l\, g / ^ (N/d) - 1 
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therefore, 



d = b and b = a. 



Moreover, any permutation is decomposed into only two reducible 
connection sets d and C 2 , and if [„*,,] E C, , then [ T f f) J and [ T *;,>] E 
C 2 for some z, , where tt(x,) is the dual of tt(x). In coded form, the dual 
is obtained merely by complementing the least significant bit. 

One method, called the looping procedure, of constructing C\ and C 2 
from P is to search the I/O memory for the required outputs. The 
sequence starts by selecting the output tt(0) in the first location of 
the memory (or [ T \ 0) ]). The first (n - 1) bits (a n _,a n _ 2 • • •■ a,) of the 
address which are all zeros define the first /3-element (/3 n ) in the input 
stage, and the last bit (a ) which is also zero defines j8 n to be set to the 
"straight through" state (see Fig. 5). The bits w?„_,??? n _ 2 ■ • • m, and 
m at this address define one of the /3-elements (/3 0I ) in the output stage 
and its setting respectively. Bit m is now reset to zero to designate 
that this particular output (»? n _ 1 ??? n _ 2 • • • m,) is for P, . There is also 
another bit m n which is set to one when that particular output has 
been placed in P t or P 2 so that an unused output can be selected when 
it is necessary. The example in Section 4.2.1 will clarify this. 

The memory is next scanned for the dual of x(0); this output and 
its address x k define [ r ' k xh y] for C 2 ■ For the input-output pairs in C 2 



N-1 
N 




2 2 

3 N/2 x N/2 3 
SUBNETWORK 



N/2 



N/2 



N/2 xN/2 
SUBNETWORK 

N/2 N/2 




N-1 

N 



- STRAIGHT THROUGH 



- CROSSOVER 



Fig. 5 — Base-2 (N X N) rearrangeable switching network. 
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it is only necessary to set m„ and m of the memory word to '1' to 
signify that this word (output) has been used and that it is for P 2 • 
The output ir{x k ) at the address £ k defines [ r % t) ) and is designated 
for C, . The same looping procedure is continued until all 7r(.r,), 
1 ^ i ^ N are assigned to either Ci or C 2 ■ In most cases, however, 
the looping will end before all output integers are used. The procedure 
is started again by arbitrarily selecting an unassigned output for Pi , 
by examining m n . Because of this arbitrariness, C, is, in general, not 
unique. This fact is used to advantage in the other procedure to be 
described in Section 4.3. 

After the looping procedure is completed, the memory is reorganized 
to have Pi and P 2 in locations to N/2 — 1 and N/2 to N — 1, respec- 
tively. (A small scratch pad memory may be necessary.) Then the same 
procedure is applied to each of these (N/2 X N/2) subnetworks, and 
it is continued until all of the /3-elements in the (N X N) network 
are set. 

The searching can be eliminated by employing two memories, one 
of which is the I/O memory. The additional one is an output/input 
(O/I) memory that stores the input ,r, at the address corresponding 
to the numerical value of the output t(x { ). The decomposition of P 
into Pi and P 2 is achieved by crisscrossing between the two memories. 
For example, output *■(&,) and its corresponding address x t in the 
I/O memory define [,<£,>] for d . Now the dual of r(z<), sayir(a?/), 
is the address for the O/I memory and the corresponding word .r, 
defines [ T z i n J for C 2 . Then i, is used for the address in the I/O memory. 
If the I/O memory is a content addressable memory (CAM), 12 the 
required r(Xi)'a for C, and C 2 are determined directly in the content 
addressable mode, without the use of a second memory. 

4.2.1 An Example of the Looping Procedure 

In order to illustrate in a meaningful way the looping procedure 
using the coding scheme at various levels, a permutation for a (32 X 
32) network, as given in Table III, will be utilized. 

The looping procedure begins with WO) = 14 and continues to 
select input-output pairs for the connection set C x ■ (The sequence of 
this selection is indicated by the number in the "looping sequence" 
column.) As each output integer is selected for Ci , the last bit m,) is 
used to set the output /^-element designated by (m 4 m 3 m 2 Wi) (see 
Fig. 6) . Then the bit wi (l is set to '0' to indicate that it is for Ci , and 
the bit m 5 is set to '1' to indicate that it has been used. The bits m n and 
m& of the output integers for C> are merely set to '1'. In this particular 
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Table ID 


— Memory Cox 


TENTS ox the 


Interconnections 


Input 


Outputs 




Coded Output Integers 


Looping 


Terminals 


Terminals 


m 5 


nii 7713 m.2 


mi mo 


Sequence 





14 





1 1 


1 


1 


1 


23 





1 1 


1 1 




2 


3 








1 1 


7 


3 


20 





1 1 







4 


28 





1 1 1 





8 


5 


2 








1 




6 


16 





1 







7 


11 





1 


1 1 




8 


31 





1 1 1 


1 1 


9 


9 


29 





1 1 1 


1 




10 


1 








1 




11 


27 





1 1 


1 1 


5 


12 


12 





1 1 





3 


13 


18 





1 


1 




14 


5 





1 


1 




15 


24 





1 1 





11 


16 


26 





1 1 


1 




17 


21 





1 1 


l 


6 


18 














4 


19 


13 





1 1 


1 




20 


8 





10 







21 


6 





1 


1 


13 


22 


19 





1 


1 1 


2 


23 


15 





1 1 


1 1 




24 


4 





1 





10 


25 


30 





1 1 1 


1 




26 


7 





1 


1 1 




27 


22 





1 1 


1 


14 


28 


17 





1 


1 




29 


10 





1 


1 




30 


25 





1 1 


1 




31 


9 





10 


1 


12 



example, the looping procedure ends prematurely and leaves the con- 
nection set 

[6 7 28 29 
16 11 17 10 

as indicated on Table IV. 

One then repeats the looping procedure by starting arbitrarily at 
some remaining output integers (as indicated by m-, = 0) . After every 
output integer has been assigned to G (or C 2 ), one rearranges the 
memory such that all the integers with m Q = occupy the upper half 
of the memory, corresponding to the connections through the upper 
network, with the remaining output integers for the lower network. This 
is shown in Table V. 
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Fig. 6 — A (32 X 32) rearrangeable network with partial setting of /S-elements. 

Table VI shows the memory contents after another looping pro- 
cedure is applied to the output integers (for both the upper and lower 
networks) and subsequent rearranging. At this level, m^n^ denotes the 
(8 X 8) subnetwork, and if one reverses the order to m mi , it is just 
the binary representation of natural ascending numbers, being the 
upmost (8 X 8) subnetwork and 3 being the lowest (8 X 8) subnet- 
work. One could also obtain the same information from the addresses, 
since the contents are rearranged into this order. 

4.3 Decomposition ivith a Trial-Partition Procedure 

A second method for decomposing P which incorporates a trial-and- 
error procedure is presented. Although this method is practical only 
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for small networks, it is very important because of the simple imple- 
mentation, and intelligence can he included to reduce the average 
processing time by taking advantage of the fact that the decomposi- 
tion may not be unique. 

From the derivation of the reducible connection sets C x and C 2 , 
it is seen that they must contain one and only one r(x t ) from each 
S, ,0 ^ Z < (N/2) - 1, and the corresponding input :r, . Since [„° 0) J and 
[.{„] are defined to be in C, and C 2 , respectively, only (N/2 - 1) 
additional input-output pairs must be selected for C, ; the remaining 
pairs are for C 2 . After C, and C 2 are determined, P, and P 2 and the 
rearranging of the I/O memory are derived in the same manner as 
in the looping procedure. Let ^ be the set of connection sets; each 
includes (N/2 — 1) input-output pairs formed by having one t(x,) from 

Table IV— Memory Contents After Sequencing Through One Loop 



m b 


mt 


Wl3 


1)1 2 
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Table V — Memory Contents After Rearranging 
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each Si , I > 0. Since there are two elements in each S t , * consists of 
2 W "" 1 connection sets. For any arbitrarily selected C £ ^, the test 
will be only on the output integers. Therefore, C defines C\ if and onty 
if for any 7r(x.) £ C, its dual is not in C. 

These ideas lead to the application of a finite state machine with 
2<w/2)-i s t a t e s which are used to generate *. This machine, called the 
Trial-Partition Machine (TPM), is composed of {N/2 — 1) two-state 
storage devices (flip-flops), each of which represent an input integer 
pair. The "0" or "1" state of the flip-flop designates that the odd 
or even input, respectively, of the input integer pair and the corre- 
sponding output is in C. If <r, is a state of the TPM, then it defines 
C, and the r(g<) £ C are tested for an integer pair (two outputs with 
the same characteristic). If C contains no pairs, then it is reducible 
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and can be used as C\ . However, if there is at least one output integer 
pair in C, the TPM is advanced to a i+1 . Since the outputs are serially 
stored in the I/O memory, the memory must be sequenced in order to 
perform the test for each o-, . 

This type of a TPM can be easily implemented with a (N/2 — 1) 
bit binary counter. However, it may be desired to have a more intelligent 
machine so that C x can be determined in less time (with fewer trials). 
Of course, the complexity and, consequently, the cost of the TPM 
will increase as the intelligence of the machine grows. One way to 



Table VI — Memory Contents at the Third Level (after two 
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enhance the intelligence is to count the number of output integer 
pairs in C. This number (Z) specifies how many [ „.*£,, J must be changed 
in order to have all w(Xi) from different output integer pairs. Then the 
next set (C) is selected from those sets which contain at least Z different 
pairs and have not been previously tested. Another method is to observe 
the input-output pairs [,*£<)] and [,**,-) J that form an output integer 
pair, and select a C that contains either x t or x, . For an example of 
a TPM, see Ref. 11. 

The TPM is applicable only for small networks because the number 
of tests becomes prohibitive as N increases. There are 2 (AV2)_1 con- 
nection sets in * and, on the average, half of these must be tried. A 
TPM with intelligence, however, will reduce the number of sets to 
be tested, but for networks larger than (64 X 64) this will still be too 
time consuming. 

4.4 Using Combinational Logic 

The control algorithm for the (4 X 4) network can be imple- 
mented with combinational logic. This is achieved by assigning the 
states of the five ^-elements for each of the 24 possible input-output 
permutations. The combinational logic, which consists of 13 nand 
gates, determines the correct setting from the outputs which are 
coded in the binary code. This method is practical only for very small 
networks because the number of permutations grows very rapidly. The 
(4 X 4) network is important, however, because it could be used as 
a building block to construct larger networks, and the combinational 
logic and the /^-elements could be on the same semiconductor chip. 
This same idea also applies to an (8 X 8) or (16 X 16) network using 
the TPM. 

4.5 System Description 

The block diagram of a rearrangeable switching system with only 
one memory (I/O Memory) is shown in Fig. 7. The Network Control 
realizes the control algorithm by employing one or more of the above 
methods. For example, it may be advantageous to use the combina- 
tional logic and TPM for the small networks (or subnetworks) because 
they are relatively inexpensive to implement. However, as the size of 
the network grows, the processing time becomes critical. Then the 
looping procedure with one and two memories (or a content address- 
able memory) may be used for less than 1000 terminals and more than 
1000 terminals, respectively. The timing considerations for these 
methods are discussed in the next section. 
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Fig. 7 — System block diagram. 

The Scratch Pad Memory temporarily stores the outputs during 
decomposition of P and also while the I/O Memory is being par- 
titioned. The System Control generates the timing and control 
sequences for all of the operations. This unit, as well as the Network 
Control, could be implemented with stored program techniques if it 
is economical and if there is sufficient real time. 

The other important unit, called the Network Buffer, isolates the 
network from the Network Control so that the existing traffic will not 
be affected during the processing time of the control algorithm. As the 
settings of the /3-elements are determined, they are stored in the 
Network Buffer. After the control algorithm is completed, the states 
of the /3-elements are set within the time required to insure the quality 
of the transmission. This time is dictated by the switching transients 
of the /3-elements, network terminations, and the application. Serial 
shift-registers can provide economical buffers, if the /3-elements can 
be set in a stage-by-stage sequence. 



V. TIMING CONSIDERATIONS 

In this section, the processing time and the necessary equipment for 
the various methods of implementing the control algorithm are dis- 
cussed. The processing time for the algorithm must be sufficiently short 
to accommodate the traffic changes. The combinational logic method 
is the fastest; however, it is applicable only for very small networks. 
The TPM has the next order of complexity, and it is applicable for 
(64 X 64) networks or smaller. The processing time, which increases 
exponentially, is derived as follows: For an {N X N) network, there 
are 2'- v/2 '- 1 states of the TPM. If the TPM is a binary counter (no 
intelligence), on the average about half of the states or 2 ,A ' / - )_2 must 
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be tested before the given P is decomposed to C-i and C-> . Also, on the 
average, one-half of the I/O memory is scanned before an output 
integer pair is detected. Therefore, the number {At) of times that the 
I/O memory is accessed for logJV levels [the logJV ,h level is the 
trivial (2x2) network, and the number of times the memory is ac- 
cessed is simply N/2] is: 

XT log,iV-l «r 

4»-f E 2»'" + f- (1) 

The processing time can be greatly reduced by using the looping 
procedure. If a random-access I/O memory is employed, it is necessary 
to search for one-half of the outputs (the remaining outputs are 
obtained directly from memory) on each level of the network. For an 
(N X N) network, N/2 outputs are determined by accessing the I/O 
memory, on the average, N/2 times. Then 

log s Af / 1 \ i + 1 

A T = N 2 Z (j) = WN - 1). (2) 

In addition, the access time required to partition the memory is 2N 
log 2 iV and should be included in equations (1) and (2) to give the 
total processing time. 

If two memories (in the crisscross manner) or a CAM is utilized, 
no searching is necessary, and access times for the decomposition of P 
and partitioning of memories are: 

A T = W \og 2 N 
and (3) 

A T = 3A r log 2 N respectively. 

For N = 16,384 and a CAM with 1-fisec access time, the control 
algorithm, implemented with wired logic and a 10-MHz basic clock, 
can be accomplished in approximately 750 msec. If two memories 
(random access) with 1-fj.sec cycle time are used, a processor with 
an instruction execution time of 3 /xsec can implement the control 
algorithm in approximately 50 seconds. 

VI. CONCLUSION 

In this paper, the network structure and control algorithm for cer- 
tain (N X N) rearrangeable switching networks are described. The 
algorithm consists of decomposing a given permutation into d (where 
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d is the base of the network) permutations for the d (N/d X N/d) 
subnetworks, and determining the connections for the N/d (d X d) 
networks in the input and output stages. The same procedure is 
applied to the subnetworks in an iterative manner until all of the 
connections in the (N X N) network are defined. 

Although the network can be constructed with various building 
blocks (bases), the base-2 structure is the most important because it 
requires the least number of two-state devices (^-elements) and the 
control algorithm is relatively simple. The algorithm is implemented 
by performing the decomposition either by the looping procedure or 
by a Trial-Partition Machine. Also, an efficient coding scheme is de- 
fined to facilitate the decomposition of the permutation and the par- 
titioning of the memory. With the base-2 structure, the control 
algorithm and the coding scheme can be used in a consistent manner 
at each level of the network. There are other classes of network struc- 
tures with the same number of /3-elements, such as the nested-tree net- 
works, that do not have this property. 

The processing time and equipment complexity vary with the 
methods of implementation. The combinational logic is the fastest and 
least expensive; however, it is only applicable for very small networks. 
The Trial-Partition Machine is economical, but it is too slow for large 
networks; however, intelligence can be designed into the machine 
taking advantage of the fact that for a large number of permutations 
there are more ways than one of setting the network. Consequently, 
the processing time is dependent on the permutations given, as well as 
the amount of intelligence built in. The most suitable method for 
networks larger than (64 X 64) is the looping procedure with a con- 
tent addressable memory to store the outputs. The processing time is 
independent of the permutations given. With this method, it is pos- 
sible to determine the setting of all the ^-elements for a (16,384 X 
16,384) network in less than one second for any number of new con- 
nections or terminations. During the processing time, the new settings 
for ^-elements are stored in a buffer; then their states are changed. 
The memory required for this system is about 300k bits or approxi- 
mately 20 bits per input terminal. 

This paper has described a control algorithm for a rearrangeable 
switching network that is practical from both the system and process- 
ing time viewpoints. The application of this network should be con- 
sidered where full access and nonblocking is required, and rerouting 
is possible. 
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